Skip to content

Conversation

@shivakunv
Copy link
Contributor

No description provided.

@shivakunv shivakunv self-assigned this Dec 2, 2025
@shivakunv shivakunv force-pushed the supportrhel10 branch 2 times, most recently from 9a3e0d9 to 3820eed Compare December 12, 2025 12:25
@shivakunv shivakunv force-pushed the supportrhel10 branch 6 times, most recently from 38cebfe to 56f469c Compare December 15, 2025 07:38
Copy link
Contributor Author

@shivakunv shivakunv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

help needed here.
Please check which base image and driver version to use for RHEL10 support .
@rajathagasthya @tariq1890 @cdesiniotis

@shivakunv shivakunv force-pushed the supportrhel10 branch 4 times, most recently from 70e7101 to 6e35a8e Compare December 16, 2025 16:09
@shivakunv
Copy link
Contributor Author

Please find addtional information regarding extrapackage and driver version availability:-

dnf list available nvidia-driver-580*
nvidia-driver.aarch64                                                                                                                                                       3:580.105.08-1.el10                                                                                                                                                       cuda              
nvidia-driver.x86_64                                                                                                                                                        3:580.105.08-1.el10                            
dnf list available nvidia-driver-590*
nvidia-driver-assistant.noarch                                                                                                                                                   0.23.44.01-1                                                                                                                                                         cuda              
nvidia-driver-assistant.noarch                                                                                                                                                   0.23.44.01-1                                                                                                                                                         cuda-rhel10-x86_64
nvidia-driver-cuda.aarch64                                                                                                                                                       3:590.44.01-1.el10                                                                                                                                                   cuda              
nvidia-driver-cuda.x86_64                                                                                                                                                        3:590.44.01-1.el10                                                                                                                                                   cuda-rhel10-x86_64
nvidia-driver-cuda-libs.aarch64                                                                                                                                                  3:590.44.01-1.el10                                                                                                                                                   cuda              
nvidia-driver-cuda-libs.x86_64                                                                                                                                                   3:590.44.01-1.el10                                                                                                                                                   cuda-rhel10-x86_64
nvidia-driver-libs.aarch64                                                                                                                                                       3:590.44.01-1.el10                                                                                                                                                   cuda              
nvidia-driver-libs.x86_64                                                                                                                                                        3:590.44.01-1.el10                                                                                                                                                   cuda-rhel10-x86_64
 dnf list available nvidia-fabric*
 Available Packages
nvidia-fabric-manager-devel.aarch64                                                                                                                                                 590.44.01-1.el10                                                                                                                                                  cuda              
nvidia-fabric-manager-devel.x86_64                                                                                                                                                  590.44.01-1.el10                                                                                                                                                  cuda-rhel10-x86_64
nvidia-fabricmanager.aarch64                                                                                                                                                        590.44.01-1.el10                                                                                                                                                  cuda              
nvidia-fabricmanager.x86_64                                                                                                                                                         590.44.01-1.el10                                                                                                                                                  cuda-rhel10-x86_64
nvidia-fabricmanager-devel.aarch64                                                                                                                                                  580.65.06-1                                                                                                                                                       cuda              
nvidia-fabricmanager-devel.x86_64                                                                                                                                                   580.65.06-1                                                                                                                                                       cuda-rhel10-x86_64
dnf list available libnvidia-nscq*
Available Packages
libnvidia-nscq.aarch64                                                                                                                                                        590.44.01-1.el10                                                                                                                                                        cuda              
libnvidia-nscq.x86_64                                                                                                                                                         590.44.01-1.el10                                                                                                                                                        cuda-rhel10-x86_64
dnf list available libnvsdm*
Available Packages
libnvsdm.x86_64                                                                                                                                                              590.44.01-1.el10                                                                                                                                                         cuda-rhel10-x86_64
libnvsdm-devel.x86_64                                                                                                                                                        590.44.01-1.el10                                                                                                                                                         cuda-rhel10-x86_64
dnf list available infiniband-diags*
Available Packages
infiniband-diags.aarch64                                                                                                                                                  57.0-2.el10                                                                                                                                                  ubi-10-for-aarch64-appstream-rpms
dnf list available nvidia-imex* 
Available Packages
nvidia-imex.aarch64                                                                                                                                                         590.44.01-1.el10                                                                                                                                                          cuda              
nvidia-imex.x86_64                                                                                                                                                          590.44.01-1.el10                

used gunzip instead of unzboot for as it is not available for rhel10

@shivakunv shivakunv force-pushed the supportrhel10 branch 4 times, most recently from 5de7278 to b460e52 Compare January 14, 2026 12:47
@shivakunv
Copy link
Contributor Author

Copy link
Contributor Author

@shivakunv shivakunv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have added comments regarding some minor changes that differ from rhel9.


extra_pkgs_install() {
if [ "$DRIVER_TYPE" != "vgpu" ]; then
if dnf module list nvidia-driver:${DRIVER_BRANCH}-dkms 2>/dev/null | grep -q "nvidia-driver"; then
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I added a condition to check for the availability of dkms, as it is currently not available for 580 driver

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I prefer a check like this if [ "$DRIVER_BRANCH" -ge "580" ]; then for these conditional blocks. Can you find out which driver branches don't have this particular dnf module ?

echo "Installing Linux kernel headers..."
# Check if kernel headers are already available (mounted from host)
if [ -d "/usr/src/kernels/${KERNEL_VERSION}" ]; then
echo "Kernel headers for ${KERNEL_VERSION} already present at /usr/src/kernels/${KERNEL_VERSION}"
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have added a condition for installing the kernel headers.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this change strictly needed for RHEL 10? If this is a cleanup or enhancement, let's do it in a follow-up instead

ln -s /usr/src/kernels/${KERNEL_VERSION} /lib/modules/${KERNEL_VERSION}/build

echo "Installing Linux kernel module files..."
if ! dnf -q -y --releasever=${DNF_RELEASEVER} install kernel-${KERNEL_VERSION} > /dev/null 2>&1; then
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have added a condition for installing the kernel headers.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this change strictly needed for RHEL 10? If this is a cleanup or enhancement, let's do it in a follow-up instead

echo "kernel requires gcc version: 'gcc-${gcc_version}', current gcc version is '${current_gcc}'"

if ! [[ "${current_gcc}" =~ "gcc-${gcc_version}"-.* ]]; then
echo "WARNING: GCC version mismatch detected, but attempting to continue..."
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

gcc mismatch handled gracefully

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this change strictly needed for RHEL 10? If this is a cleanup or enhancement, let's do it in a follow-up instead

Signed-off-by: Shiva Kumar (SW-CLOUD) <[email protected]>

extra_pkgs_install() {
if [ "$DRIVER_TYPE" != "vgpu" ]; then
if dnf module list nvidia-driver:${DRIVER_BRANCH}-dkms 2>/dev/null | grep -q "nvidia-driver"; then
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I prefer a check like this if [ "$DRIVER_BRANCH" -ge "580" ]; then for these conditional blocks. Can you find out which driver branches don't have this particular dnf module ?

# Download unzboot as kernel images are compressed in the zboot format on RHEL 9 arm64
# unzboot is only available on the EPEL RPM repo
rpm --import https://dl.fedoraproject.org/pub/epel/RPM-GPG-KEY-EPEL-9
dnf install -y https://dl.fedoraproject.org/pub/epel/epel-release-latest-9.noarch.rpm
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we should be using EPEL 10, not 9. If dnf install -y unzboot does work in EPEL 10, then there is probably another command to install unzboot in RHEL 10 + EPEL 10 envs

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Using references like this will help you find out the correct package repos and installation commands

Shiva Kumar added 4 commits January 16, 2026 18:10
Signed-off-by: Shiva Kumar (SW-CLOUD) <[email protected]>
Signed-off-by: Shiva Kumar (SW-CLOUD) <[email protected]>
Signed-off-by: Shiva Kumar (SW-CLOUD) <[email protected]>
Signed-off-by: Shiva Kumar (SW-CLOUD) <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants